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Abstract —Using a very low-resolution analog-to-digital con¬ 
vertor (ADC) unit at each antenna can remarkably reduce the 
hardware cost and power consumption of a massive multiple- 
input multiple-output (MIMO) system. However, such a pure 
low-resolution ADC architecture also complicates parameter 
estimation problems such as time/frequency synchronization and 
channel estimation. A mixed-ADC architecture, where most of 
the antennas are equipped with low-precision ADCs while a 
few antennas have full-precision ADCs, can solve these issues 
and actualize the potential of the pure low-resolution ADC 
architecture. In this paper, we present a unified framework to 
develop a family of detectors over the massive MIMO uplink 
system with the mixed-ADC receiver architecture by exploiting 
probabilistic Bayesian inference. As a basic setup, an optimal 
detector is developed to provide a minimum mean-squared- 
error (MMSE) estimate on data symbols. Considering the highly 
nonlinear steps involved in the quantization process, we also 
investigate the potential for complexity reduction on the optimal 
detector by postulating the common pseudo-quantization noise 
(PQN) model. In particular, we provide asymptotic performance 
expressions including the MSE and bit error rate for the optimal 
and suboptimal MIMO detectors. The asymptotic performance 
expressions can be evaluated quickly and efficiently; thus, they 
are useful in system design optimization. We show that in the 
low signal-to-noise ratio (SNR) regime, the distortion caused by 
the PQN model can be ignored, whereas in the high-SNR regime, 
such distortion may cause 1-bit detection performance loss. The 
performance gap resulting from the PQN model can be narrowed 
by a small fraction of high-precision ADCs in the mixed-ADC 
architecture. 

Index Terms —Massive MIMO, MIMO detector, low-resolution 
ADC, mixed architecture, Bayesian Inference. 


1. Introduction 

Massive multiple-input multiple-output (MIMO) systems 
are widely regarded as a disruptive technology for next- 
generation (i.e., 5G) communication systems [l]-[3]. By 
equipping a base station (BS) with an unprecedented number 
of antennas (a few hundreds or a thousand) in a centralized 
[4], [5] or distributed [6] fashion, such a system can reduce 
cell interference substantially through the simplest signal pro¬ 
cessing method because the channel vector between the users 
and the BS becomes quasi-orthogonal. 

However, a large number of antennas significantly compli¬ 
cate the design of hardware for the implementation of massive 
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MIMO in production. In particular, such systems require 
an analog-to-digital converter (ADC) unit for each receiver 
antenna; therefore, using many antennas results in a need for 
an equivalent number of ADCs. The exponential increase in 
cost and power consumption attributed to high-speed and high- 
resolution ADCs* is a major bottleneck in deploying massive 
MIMO systems [8]. A solution to this problem involves using 
a very low-resolution ADC (e.g., 1-3 bit) unit at each radio 
frequency chain [9]-[14]. 

Previous work; Low-resolution ADCs have favorable prop¬ 
erties such as reduced circuit complexity, low power consump¬ 
tion, and feasible implementability. However, these converters 
inevitably deteriorate performance and complicate receiver 
design. The effect of low-resolution ADCs on channel capacity 
has been studied for single-input single-output (SISO) chan¬ 
nels [15] and, recently, MIMO channels [12]-[14]. Through a 
single antenna with 1-bit quantization at the receiver, channel 
capacity can be achieved by QPSK signaling [15]. In 1- 
bit MIMO cases, however, high-order constellations can be 
used to generate rates higher than those produced with QPSK 
signaling [13], [14]. In addition to information-theoretical 
studies, other works related to estimation/detection based on 
low-resolution samples include time/frequency synchroniza¬ 
tion [16], channel estimation [17]-[20], data detection [9], 
[19]-[24], and related performance analysis [25], [26]. 

The strongly nonlinear characteristic of the quantization 
process complicates the precise estimate of continuous vari¬ 
ables. For example, a very long training sequence (requiring 
over 50 times the number of users) is necessary to achieve the 
same performance as the perfect CSI case in a MIMO system 
with 1-bit ADCs [9]. Such pilot overhead cannot be sustained 
by a practical system; to reduce pilot cost, a joint channel- 
and-data estimation method that used predicated payload data 
to aid channel estimation was proposed in [19]. However, 
this technique enhances the computational complexity of the 
receiver. Other practical issues, such as automatic gain control 
(AGC) and time/frequency synchronization, have not been 
thoroughly examined in receivers with the pure low-resolution 
ADC architecture. 

Motivated by the aforementioned considerations, we present 
a mixed-ADC receiver architecture named mixed-ADC mas¬ 
sive MIMO system in the current study. In this architecture, 
most antennas were installed with low-resolution ADCs while 
a few antennas were equipped with full-resolution ADCs. A 
special case of the mixed-ADC massive MIMO system was 

'High-speed ADCs with a resolution of k hits typically adopt a flash 
architecture where the input voltage is compared with each of the 2'^ tap 
voltages simultaneously [7], 
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initially proposed by [27]; in this scenario, 1-bit ADCs replace 
the low-resolution ADCs. Under the mixed-ADC framework, 
CSI can be obtained in a round-robin manner [27] in which 
high-resolution ADCs are connected to different antennas at 
various symbol times to estimate the corresponding channel 
coefficients. In the process, good-quality CSI is available at 
the receiver without significant pilots overhead. The available 
high-resolution chains can also assist in estimating other pa¬ 
rameters and thus facilitate the establishment of several front- 
end designs, such as AGC and time/frequency synchronization. 
From an economic perspective, the mixed-ADC architecture is 
promising because it can be implemented by adding antennas 
with low-resolution ADCs to the existing BS. 

Contributions; In this study, we aim to examine mixed- 
ADC massive MIMO systems from a practical engineering 
perspective. In contrast to [27], which emphasizes the analysis 
of mutual information, our approach focuses on the MIMO 
(or multiuser) detection problem at the receiver. An exten¬ 
sive review of the large family of various MIMO detection 
algorithms is provided in [28], and MIMO detectors based on 
quantized samples are studied in [19]-[22], [24], [25]. To the 
best of our knowledge, no research has shed light on MIMO 
detection problems in mixed-ADC architecture, the design and 
performance of which are in fact a central concern related 
to this architecture. This paper makes the following specific 
contributions: 

• By exploiting probabilistic Bayesian inference, we pro¬ 
vide a unified framework to develop a family of MIMO 
detectors. This framework is labeled as the (generic) 
Bayes detector. To compute the Bayesian estimate, we 
must establish a prior distribution for the transmitted 
data and a likelihood function for the statistical model 
of receiver signals. Upon adopting the true prior and 
likelihood functions, the Bayes detector can achieve the 
best estimate in the mean squared error (MSB) sense. 
Properly postulating mismatched prior and likelihood 
functions can yield many low-complexity and popular 
detectors, such as the linear minimum mean-squared- 
error (LMMSE) and maximal-ratio-combining (MRC) 
receivers. 

• The exact expression for the likelihood of a receiver sig¬ 
nal with quantization is complex given the highly nonlin¬ 
ear property of the quantizer. The “mixture” architecture 
complicates the design of the Bayes detector further. A 
natural question is the following: how close to the best 
performance can a conventional MIMO detector operate 
without considering the exact (while annoying) nonlinear 
effect of the quantizers? To answer this question, we 
adopt a traditional heuristic that treats quantization noise 
as additive and independent. This heuristic is known as 
the pseudo-quantization noise (PQN) model [29, Chapter 
4]. By postulating a mismatched likelihood using this 
model in the Bayes detector, we can reduce compu¬ 
tational cost significantly while degrading performance 
only slightly. 

• To achieve the Bayes detector, we employ a recently 
developed technique called generalized approximate mes- 
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Fig. 1. (a) A mixed-ADC massive MIMO architecture, (b) A 3-bit uniform 

midrise quantizer, (c) The (generic) Bayes detector. 


sage passing (GAMP) algorithm [30]. We adapt this 
approximation for the mixed-ADC architecture by spec¬ 
ifying the corresponding adjustment in nonlinear steps. 
By applying the central-limit theorem (CUT) to the large 
system limit, we derive an approximate analytical expres¬ 
sion for the state evolution (SB) of the Bayes detector. 
A series of metrics, including bit error rate (BBR) and 
MSB, can be predicted, and computer simulations are 
conducted to verify the accuracy of our analysis. The 
performance of the mixed-ADC massive MIMO receivers 
can be quickly and efficiently evaluated. Several useful 
observations are made based on the analysis to optimize 
the receiver design. 

Notation: Throughout this paper, vectors and matrices are 
presented in bold typeface, e.g., x and X, respectively, while 
scalars are presented in regular typeface, e.g., x. We use 
and X* to respectively represent the transpose and conjugate 
transpose of a matrix X. Re(-) and Im( ) respectively denote 
the real and imaginary parts of a complex matrix (vector). 
Normal distributions with mean p, and variance cr^ are denoted 
by while CJf{p,a^) indicates a complex Gaussian 

distribution. Specihcally, JV{z; p,a‘^) [or CAf{z; p,a^)] de¬ 
notes the probability density function (pdf) of a Gaussian 
random variable z with mean p and variance cr^. Binally, let 
4>(a;) = Df with Df = dt. 

II. System Description 

We consider an uplink multiuser MIMO system that has K 
single-antenna users and one BS equipped with an array of N 
antennas. The discrete time complex baseband received signal 
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y € is given by 

y = ^=Hx + n = Hx + n, (1) 

V K 

where x = [xj] € contains the transmitted symbols from 
all the users, n = denotes the additive white Gaus¬ 
sian noise (AWGN), H = [Hij] € represents the chan¬ 

nel matrix between the BS and the K users, and we further 
define H = The entries of H are independent and 

identically distributed (i.i.d.) random variables with Hij ~ 
CAf{0, 1), the entries of x are i.i.d. and distributed as P]„{x), 
and the entries of n are i.i.d. CJV{0,a‘^). If we suppose that 
then the signal-to-noise ratio (SNR) of the 
system is defined by SNR = 1/cr^. 

The real and imaginary components of the received signal 
at each antenna are quantized separately by an ADC. The 
quantization bits are set differently for different antennas in 
the mixed-ADC massive MIMO system. All the N complex 
valued quantizers are abstractly described as Qc(-) so that the 
quantized signal can be written as 

i’=Qc(y), (2) 

where Qc(-) is applied element-wise and defined such that 
n = Qc{y^) = Q(R.e(yj)) -t-jQ(Im(yj)). If a statement is for 
a generic antenna index, we often omit the subscript index i 
from ri for brevity. 

In this paper, we mainly focus on uniform midrise quan¬ 
tizers with quantization step size A. Such a quantizer maps a 
real-valued^ input that falls in [r — ^ ,r + to value r from 
the discrete set 

f / 1 N 2” 2” 1 

^""{("2+^)^’^ = + ’Y/’ 

where k is the quantization bits. Figure 1(b) shows a graphical 
depiction of a 3-bit uniform midrise quantizer. Notice that an 
input of magnitude larger than — 1)A saturate. For ease of 
explication, we simply express the lower and upper thresholds 
associated with r as and r“P, respectively; specifically, 
they are 


^low _ 1 

' r- ^ 

for r > — (^ 

-1)A, 

(4a) 

—oo, 

otherwise, 


r"P = < 

r T+#, 

for r < (^ - 

-1)A, 

(4b) 


l -oo, 

otherwise. 



We assume that most of the antennas adopt low-resolution 
ADCs (e.g., 1 — 3 bit) while only a few antennas adopt high- 
resolution ADCs (e.g., 12 bit). To distinguish the various 
ADCs, we use 17^ to indicate the collection of the antennas 
equipped with ADCs of k bits. Moreover, the cardinality of 
VIk, is Ak, and thus we have A„ = N. 

^We take the variance of the channel coefficient to be 1/K, to normalize 
the equivalent channel H = [Hij] row and ensure that the analog baseband 
is within a proper range, e.g., (—1, -1-1). This assumption is without loss of 
generality because in practice, a variable gain amplifier with automatic gain 
control is used before quantization. Moreover, the equivalent row normalized 
channel H simplifies the discussion in this study. 

^For ease of notation, we abuse r to denote each real channel although it 
should be specified as Re(r) or Im(r). 


Compared with the conventional massive MIMO system 
equipped with high-resolution ADCs, such a mixed-ADC 
architecture remarkably reduces the circuit cost and power 
consumption. The available high-resolution chains can assist 
in estimating certain channel parameters. For example, CSI 
can be obtained in a round-robin manner [27] in which 
high-resolution ADCs are connected to different antennas at 
various symbol times to estimate the corresponding channel 
coefficients. In the process, good-quality CSI is available at 
the receiver without significant pilots overhead. We assume 
that the realization of the channel is perfectly known by the 
BS; thus, we focus on the detection problem. 


III. MIMO Detectors 


To recover the multiuser signals x from the quantized mea¬ 
surement r, we adopt the Bayesian inference. The Bayesian 
inference starts with computing the posterior probability 
P(x|r, H) according to the Bayes rule 


P(x|r,H) 


Pout(r|x,H)Pin(x) 

P(r|H) 


( 5 ) 


where P(r|H) = dxPout(r|x, H)P(x) is the marginal 
likelihood function, Pout(r|x,H) is a likelihood function, and 
Pin (x) is a measure of the input distribution Pin(x). Here, 
P (including Pin and Pout) indicates that the probability is 
different from the actual one, P. 

Using the posterior probability, the Bayes estimate of the 
unknown vector x is the mean of the posterior distribution 
and its element is expressed as 


Xj = E{xj|r, H}, 


(6) 


where the expectation over Xj is w.r.t. the marginal posterior 
probability 


P(x,|r,H)= f dxP(x|r,H). (7) 

7x\a;j 

The notation f , dx denotes the integration over all the 
variables in x except for Xj. 

Note that all our developed MIMO “detectors” are centred 
around the framework of (6), which should be classified as an 
estimation problem rather than a detection problem. However, 
transmitted signal Xi is typically formed by a constellation 
modulation scheme such as quadrature amplitude modulation 
(QAM). In such cases, the final constellation point can be 
determined from Xj via the maximum-likelihood decision 
rule. Therefore, with an abuse of terminology, we call the 
framework for obtaining (6) the (generic) Bayes “detector”. 

According to (5), the posterior P(x|r, H) is determined by 
Pout(r|x, H) and Pin(x). Postulating a mismatched measure 
P 7^ P is due to a particular choice that can reduce com¬ 
putational complexity [31]. We discuss these properties in the 
subsequent subsections. With reference the Bayes detector, we 
always mean (6) with general Pojt and P\n. 
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A. De-Quantization (DQ)-optimal Detector 

We begin with the detector development from the true Pout 
and Pin, i.e, no mismatch between P and P. Let Wi = h*x 
with h* be the i-th row of H. According to (1) and (2), the 
likelihood function (or the conditional probability distribution) 
of the quantized output given x is expressed as 

N 

Pout(r|x, H) = ]^Pout(Re{ri}|Re{'u;J) 

i^l 

X Poot(Im{ri}|Im{'u;j}). (8) 

where 



Using the true likelihood function (8) in conjunction with the 
true prior distribution Pin to (5), we obtain the exact posterior 
probability P(x|r, H). In this case, Xj via (6) can achieve the 
best estimates in terms of MSB. We refer to this detector as 
the de-quantization (DQ)-optimal detector. 

However, the DQ-optimal detector is not computation¬ 
ally tractable because the marginal posterior probability 
P(a;jjr, H) in (7) involves a high-dimensional integral. The 
canonical solution to this problem is using belief propagation 
(BP) by expressing the posterior probability P(x|r,H) as a 
factor graph [32]. Unfortunately, the complexity of BP is still 
too high for practical application. To solve this problem, BP 
was recently simplified as the GAMP algorithm [30] by using 
the second-order approximations at measurement nodes. We 
resort to the GAMP algorithm as an iterative procedure to 
solve the marginal posterior probability in a recursive way. 
Algorithm 1 provides a high-level description of GAMP to 
perform the (generic) Bayes detector under the mixed-ADC 
architecture. Note that Algorithm 1 can be used not only to 
achieve the DQ-optimal detector but also to operate other 
detectors introduced subsequently. 


Algorithm 1: GAMP Algorithm 

input : The received quantized signal, r; 

The channel state information, H; _ 

The prior distribution of the input signal, Pin(x); 
The likelihood function, Pout(^l^t^); 

output : 

definition: g{p,Vp) = ^ log dwPout{r\w)e 

initialize : = 0, vj = 1, p° = 0; 

for t = 1, • ■ • ,tmax do 
Output Step: 

1 v‘ = |h|M-'; 

2 p* = Hx* ^ — Vp O gi(p* 

3 v*=g'(p*,vp; 

4 z* =g(p*,vj,); 

Input Step: 

5 V* = [(|H|2)rv*]-i; 

6 s‘ = x‘-i-Hv*o(H*z‘); 

7 V* = Var{x|s*,v*}; 

8 X* = E{x|s*, V*}; 



Here, t and fmax represent the current iteration index 
and the maximum iteration times, respectively. We define 
|Ap = [|Ay p], and o denotes the Hadamard product; i.e., 
AoB = [AijBij], Note that g{-), g'i'), E{-}, and Var{-} 
are applied element-wise. To better understand the algorithm, 
we provide some intuition on each step of Algorithm 1. 
Lines 1-2 compute an estimate p of the product Hx and the 
corresponding variance Vp. The first term of p is a plug-in 
estimate of Hx and the second term provides a refinement by 
introducing Onsager correction in the context of AMP [33]. 
Using {p,Vp}, lines 3^ then compute the posterior estimate 
of the residual y — p and the inverse residual variances v^, 
where y is the posterior estimate of the un-quantized received 
signal by considering the likelihood function Pout- Lines 5-6 
then use these residual terms to compute s and v^, where s 
can be interpreted as an observation of x under an AWGN 
channel with zero mean and variance of v^. Finally, lines 7-8 
estimate the posterior mean x and variances by considering 
the prior 

Using the likelihood function of (9) to Algorithm 1, we can 
obtain analytic expressions of g and g\ which are 


where^ 


9iP,Vp) 

9'iP,Vp) 


r — p 


1 - 


Clow W CAf{w\p, Vp)dw 
/Cw CAf{w\p, al Vp)dw 
.2 /Cw |w - f\'^ CM{w\p, of -\-Vp)dw 

/Cw CM{w\p, 0-2 -f Upjdw 


(10a) 

(10b) 


(11a) 

(lib) 


Note that the integration interval r“P] in (11) varies with 

different antennas because the quantization bits are set differ¬ 
ently for different antennas in the mixed-ADC architecture. 
The variance Var{-} and expectation E{-} in Algorithm 1 are 
performed with respect to 


P{x\s\vl) 


CM{x\s*,vl)Pinix) 
f dx'CM(x'ls^,Vg)Pin(x') 


( 12 ) 


If the QAM constellation with M points is used, the prior is 
given by Pin (a;) = 1/M for x being the constellation points, 
and f dx' denotes the integral w.r.t. the discrete measure. 

Note that the calculations of gf), g'{), E{-}, and Var{-} for 
the DQ-optimal detector are highly nonlinear. Explicit expres¬ 
sions of the nonlinear functions are provided in [19]. Although 
the DQ-optimal detector can achieve the best estimate in terms 
of MSB, the highly nonlinear integration steps involved make 
the calculation difficult and in turn complicate the circuit. 
Moreover, in the mixed-ADC massive MIMO system, different 
antennas adopt ADCs with different resolutions, which makes 


■^Notice that ui is a complex-valued variable. The integral /Jiow dui in (11) 
is given by 

^Re(r'^P) /’lm(r^P) 


piAe(r p 

Jii 


Re(7>low) 7lm(rlow) 


dRe(Lu)dIm(Lu). 
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the quantization process even more complex. This inconve¬ 
nience motivates us to import mismatched measures in Pout 
and P\„ to simplify the calculation. 


B. Pseudo-De-Quantization (PDQ)-optimal Detector 

The nonlinear calculations of g{-) and g'{-) are due to the 
integration in (9). To simplify the calculations, one heuristic 
is to treat the nonlinear quantization process as an additive 
quantization error; i.e., 

r = Qc(Hx- f n) = Hx + n + q (13) 

with q = [qi\ = [Qc{yi) — Vi] being the quantization error. 
This heuristic is known as the PQN model [29]. 

In the PQN model, the quantization error qi and its input 
yi are generally assumed to be independent, and qi is usually 
modeled as a complex Gaussian random variable with zero 
mean and variance As a result, the measure Pout in (9) 
can be written as 

Poutir\w) = —T- ’ (14) 

with 7 = ( 7 ^ + cr^. By using Pout of (14) in Algorithm 1, we 
obtain 

T — p 

9{p,Vp) = -^, (15a) 

Up + 7 

g'ip,Vp) = —(15b) 

The highly nonlinear steps in (10) now becomes simple linear 
steps (15). Thus, the computational complexity is reduced 
significantly. We call Algorithm 1 with g and g' in (15) the 
pseudo-de-quantization(PDQ)-optimal detector. 

From the perspective of the algorithm itself, the PDQ- 
optimal detector is exactly the AMP algorithm [34] although 
adjusting 7 for each antenna is required to reflect different 
quantization bits for different antennas. Originally, the AMP 
algorithm was designed to recover a signal from a linearly 
transformed and additive Gaussian noise corrupted measure¬ 
ment. The resulting PDQ-optimal detector is induced from the 
GAMP algorithm by postulating the mismatched measure in 
Pout, which is mainly due to the low-complexity purpose. 


C. Linear Detector 

In the PDQ-optimal detector, g and g' are linearized by pos¬ 
tulating a mismatched measure in Pout while the calculation of 
{x, Vj;} in lines 7-8 of Algorithm 1 are still nonlinear. Next, 
we purpose a further complexity reduction on the nonlinear 
steps by imposing an additional mismatch in T]n- In fact, if we 
postulate that the input is Gaussian, i.e., P\n (x) = CN{x] 0,1), 
then the Bayes detector over the PQN model (13) becomes a 
linear detector. That is, Xj in ( 6 ) becomes 

x= (H*H-f 7 l)-^Hr, (16) 


^The variance of the quantization error on one side (real or imaginary) is 
usually modeled as equal to that of a uniform distribution on a quantization 
interval [—- 


A 
2 ’ 


<t2 = ^ 


12 ■ 


TABLE 1 

Special Cases eor the Generalized Bayes Estimator 


Method 

Pn 


^OUt 

distribution 

mismatch 

distribution 

mismatch 

DQ-Opt. 

Pin(x) 

No 

eq.(9) 

No 

PDQ-Opt. 

PnW 

No 

eq.(9) 

Yes 

Linear 

Gaussian 

Yes 

eq.(14) 

Yes 


where the parameter 7 is commonly called the regularization 
factor. In particular, when 7 is set to 0 and 00 , respectively, 
we immediately obtain the zero-forcing detector and the MRC 
detector [31]. 

The linear estimator (16) can also be implemented via 
Algorithm 1. To this end, we substitute Pin(a;) ^ CA/'(0,1) 
into (12), and then lines 7-8 of Algorithm 1 become 


V 


t 

Xj 


1 + 



(17) 


Now, the nonlinear steps of E{ } and Var{ } become simple 
linear steps (17). 

The computational complexity order for the dimensional 
matrix inversion in (16) is 0{NK‘^). When the linear estimator 
is implemented using Algorithm 1, the computational com¬ 
plexity order is 0{NKt^ax)- In fact. Algorithm 1 converges 
very quickly after 8-10 iterations. Thus, the computational 
complexity by using Algorithm 1 is smaller than that of the 
conventional linear detector (16) using the matrix inversion. 
In addition, the iteration steps in Algorithm 1 only involves 
matrix-vector multiplications that are highly suitable for hard¬ 
ware implementation. 


IV. Pereormance Analysis 

We have constructed numerous detectors that range from 
the optimal (but complex) detector to the suboptimal (but 
simple) MRC detector under a unified framework. All of 
these detectors are induced from the Bayesian inference by 
a particular choice of P]„ and Pout, the features of which are 
summarized in Table I. Remarkably, these detectors can be 
realized by the GAMP algorithm (i.e.. Algorithm 1); thus, their 
performance are amenable to asymptotic analysis. Specifically, 
the GAMP dynamics can be tracked by a set of state evolution 
(SE) equations [30]. 

In Section IV-A, we derive the SE equations for the generic 
Bayes detector in the mixed-ADC massive MIMO system. 
In Section IV-B, we particularize the SE equations for the 
three detectors constructed in Section III. Some performance 
comparisons among the three detectors are also provided in 
this subsection. 


A. State Evolution Analysis 

In this section, we derive the SE equations of Algorithm 1 
under the mixed-ADC massive MIMO system. Our derivation 
is performed in the large-system regime where K and N reach 
infinity while the ratios 

N/K —>■ A, Nf^/K —>■ Ak, Vk 


( 18 ) 
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remain fixed. 

We perform the analysis starting from the algorithm itself. 
Recall that output x* is deduced from line 7 of Algorithm 1. In 
lines 7-8 of Algorithm 1, the posterior averages are evaluated 
with respect to (12), which is determined by (s‘ , u* ). Thus, 
we need to determine the asymptotic behaviors for s* and its 
associated variance at iteration t. Injecting the quantities 
appearing in s‘ term in line 6 of Algorithm 1, we notice that s* 
can be represented as a sum of Xj and many other terms. Using 
the CUT, we can characterize Sj by its mean and variance. A 
proof is provided in Appendix A. The result is summarized 
by the following proposition; 

Proposition 1: Given Xj = x ^ P\rt{x), the asymptotic be¬ 
havior of Sj in Algorithm 1 can be characterized by 




(19) 


where z ^ CAf{0, 1). The parameters , E* are indepen¬ 

dent of index j and evolve as 


K 

= 5 ] A« 

K 

K 

Vx = E2;{|a:p} 

4 = e ...|/2 ^ 

= Ex,J a;*/i 


J DuT* I 7 

.r&'R.K, \ 


12 


0^ 


(20a) 


12 


if 12 


Vx ^x,z ’ 


/l 


E 

'G-R, \ 

J DuT* ^7 

) 

^ ■ X + \/A^z 

E* ’ ~E^j] 

■ X + s/Mz 1 
E* ’ ~EA 

■ x + sfMz 1 

Ip 


(20b) 


0 ' 


(20c) 

(20d) 

(20e) 

(20f) 

(20g) 



Fig. 2. The equivalent channel sees at lines 7-8 of Algorithm 1. 


□ 

Before verifying the accuracy of the analysis, we closely 
examine Proposition 1, which has several valuable features. 

(1) Analytical tractability. Proposition 1 reveals that the 
asymptotic behavior of s‘ in Algorithm 1 can be characterized 
by the scalar equivalent model (19). The scalar model (19) 
is independent of index j. This characteristic indicates that 
lines 7-8 of Algorithm 1 are decoupled into K uncoupled 
scalar operators like the posterior average is over the scalar 
equivalent channel (19) as shown in Figure 2. In the scalar 
equivalent channel, the Bayes detector postulates Pin{x) and 
P(s|a:) = CAf{s; x, -^) as the prior distribution and the like¬ 
lihood function, respectively. Note that the noise variance of 
the scalar channel is A while the Bayes detector adopts ^ 
as the noise variance because of the mismatched measure 
imported in Pout- Using the scalar equivalent channel, we can 
easily compute a large class performance metric for the Bayes 
detector such as MSE, BER, and mutual information. With 
regard to this performance metric, we are interested in the 
algorithm at time index imax — oo, and thus we omit the 
superscript index f^iax from the related parameters for brevity. 
We provide two examples as follows: 

• If Pin(x) is the QPSK signal, i.e., 

fl„(x)=li(x±P±jP), (23) 

then we can obtain the theoretical BER of the Bayes 
detector from (19). That is,® 


where = Ni^/K, x ^ P\,^{x), z ~ CA/'(0,1), and 


T'jrjT?) = $ 


y2r“P - 7? 


+ Vx - 








(xl+Vx- 


0(r|i?) = ( / dwPout(r|w)e 


t _.,t 


with vl/'(r-|7?) = ^3^, 0'(r|79) = and 


/i(s,4 = 


f dxxPin(x)e 


di9 


/ dx'Pin(P)e- 


|x'- 




f dx'Pin(x')e 


|2)*-|2 

(21a) 

(21b) 


(22a) 

(22b) 


BER = <3(^), (24) 

I ! where Q(z) = ;^= e~^ dt is the error function. 

• Eor an arbitrary modulation signal, /i(-) and /2(-) in 
(20d)-(20g) are the conditional mean and second moment 
estimates of x over the scalar channel (19) while postu¬ 
lating mismatched prior and likelihood functions. As a 
result, the MSE of the Bayes detector can be shown as 

MSE = E{|a: - = Vx - 2Re{7;2;£} -|- Vx- (25) 

(2) Generality. The analytical framework can apply to 
arbitrary postulated measures by substituting different prior 
distributions Pi„ into (22) and likelihood functions ^out into 

^For other high-order modulations, the closed-form of the BER expression 
can be derived based on the analysis in [35]. 
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(21b). Moreover, 'P(r|a;) given in (21a) characterizes the true 
quantization effect on an interval .^upj summa¬ 

tion J2ren (20a)-(20c) adds all the corresponding interval 
together. The quantization bins and the quantization level are 
conhgurable. Thus, the SE equations can incorporate arbi¬ 
trary quantization processes, such as nonuniform quantization. 
Moreover, in (20a)-(20c) indicates the effect of the 

mixed-ADC architecture. Interestingly, when particularizing 
our results to the case with a non-mixed ADC architecture, we 
recover the same asymptotic BER expression (24) as found in 
[26] with the replica method. 

(3) Computational simplicity. The analytical result is com¬ 
putationally simple because the corresponding parameters (i.e., 
A\D\E\c%vl~,vl) can be obtained in an iterative way, 
with each iteration only involving scalar summations (20a)- 
(20c) and scalar estimation computations (20d)-(20g). In fact, 
using (19), we can predict the SE in time of Algorithm 1 in 
the mixed-ADC massive MIMO system. Therefore, instead of 
performing time-consuming Monte Carlo simulations to obtain 
the corresponding performance metrics, we can predict the 
theoretical behavior by SE equations in a very short time. 

B. Explicit Expressions 

We now show special cases of the SE equations in Section 
IV-A. To present a relatively intuitive analytical result, all the 
expressions and discussions in this subsection are centered 
on a QPSK input distribution although our analysis can be 
incorporated with arbitrary input distributions. 

Eor the DQ-optimal detector, the postulated measures are 
the same as the true measures for both P]„ and Pout- Also, 
recall that the PDQ-qptimal detector only suffers a mismatched 
likelihood measure Pout in (14) and the linear detector further 
postulates the input as Gaussian. Substituting the correspond¬ 
ing measures into Proposition 1, we obtain the explicit SE 
expressions for the DQ-optimal detector, the PDQ-optimal 
detector, and the linear detector. The details of the expressions 
are listed in Table II. In particular, for the PDQ-optimal 
detector and the linear detector, the term in the logarithm of 
(21b) can be simplihed as 

/ dwPout{r\w)e = e . (27) 

As a result, the terms 0(j/|7?) and ©'(j/ji?) in (20a)-(20c) can 
be further simplihed as (26) shown in Table II. 

We now verify the accuracy of the analytical result pre¬ 
sented in Table II. Eirst, we evaluate the simplest non-mixed 
ADC case, i.e., the quantization function Qc(-) is the same for 
all antennas. In particular, we compare the BER expression 
(24) with that obtained by computer simulations. The simu¬ 
lations are conducted over 10,000 channel realizations where 
Algorithm 1 is performed with the maximum iteration times 
fmax = 20. We set the parameters as N = 200 and K = 50. 
In addition, we adopt a hxed step size A = 0.5. 

As shown in Eigure 3, the BER results for various detectors 
constructed in Section III under different quantization levels 
are compared. The markers denote the numerical results while 
the dashed curve characterizes the analytical behaviors. The 



Fig. 3. Comparison of analytical predictions against Monte Carlo simulations 
for various detectors. N = 200, K = 50, and A = 0.5 with a QPSK input. 
Dashed curve with color: theoretical prediction. Markers: simulation result. 
Solid curve: ultimate performance limit achieved by the DQ-optimal detector 
when the receiver has infinite precision to the received signal. 


hgure clearly shows precise predictions by our analytical 
expressions. Actually, we have conducted extensive simu¬ 
lation experiments to verify the accuracy of the analytical 
expressions with respect to the BER (24) and the MSE (25) 
even including the mixed-ADC architecture. All of these 
examination plots are omitted because of space limitations. 

Moreover, we observe that with the increase of quantization 
precision, all detectors generally achieve a signihcant perfor¬ 
mance improvement. Eor example, in the 1-bit quantization 
case, the PDQ-optimal detector and the linear estimators suffer 
from an error floor and almost do not work. However, in the 3- 
bit quantization case, the AMP estimator achieves an amazing 
performance gain; the gap between the DQ-optimal and PDQ- 
optimal detectors is extremely small. Even the linear detector 
only incurs a loss of 5.63 — 4.41 = 1.21dB to that attained 
by the DQ-optimal detector when BER=10“^. Recall that the 
PDQ-optimal detector drops out the integration parts in (9) to 
reduce the computation cost. Eigure 3 shows that the effect of 
this “dropping” weakens with the increase of the resolution 
level. When the resolution goes to inhnity, the integration 
effect is eventually eliminated, giving the PDQ-optimal and 
DQ-optimal detectors a common ultimate performance limit 
as predicted by the solid curve in Eigure 3. Therefore, instead 
of turning to the more sophisticated DQ-optimal detector, 
the PDQ-optimal detector with low precision (e.g., 3-bit) 
ADCs seems sufficient to achieve near optimal performance 
(4.41 — 3.83 = 0.58 dB loss at BER=10“^ is acceptable) as 
the optimal unquantized MIMO detector. 

Einally, we point out an interesting result from Eigure 3. 
Theoretically, the linear detector should have poorer perfor¬ 
mance than the PDQ-optimal detector because the linear detec¬ 
tor further imposes a mismatch in P,„. Contrary to our belief, 
this inference seems to be un-true in the 1-bit quantization 
case. The reason, as shown in the following section, is that the 










TABLE II 

Explicit Expressions for Generalized Bayes Estimators for QPSK constellation signals 
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Method 

SE parameters 

DQ-Opt. 

— Vx — 1 

Vx = Vxx = J Dutanh(A/]4tt + D) 

a = Va. V 

/A J ^/(r L/F^m) 

K V IV a; y 

D = E = A 

PDQ-Opt. 

— C-x — ^ 

Vxx = J Dutanh{V^u + D) 

Vx = j" Dutanh^{y/Au + D) 

A, D, and E are given in (20a)-(20c) 
without the superscript iteration index 
t, and the terms 0(t/|F) and 0'(y|i?) 
can be simplified as 

0(j/|i?) = , (26a) 

'y + Cx-Vx 

0'(j/|i?) = . (26b) 

'y + Cx-Vx 

Linear 

Vx = i 

A + , 1 

~ {1 +E)^ 1 +E 

D 

l + E 

A + D^ 

“ (1 + E)^ 


PDQ-optimal detector is sensitive to the step size A, which 
implies the importance of parameter setting in the MIMO 
detectors. 


V. Discussions 

In this section, we discuss optimizing the MIMO detectors 
via the analytical expressions presented in Table II. The issues 
are related to quantization-step sizes, the regularization factor 
and the mixed receiver architecture. 

A. Quantization Step Size 

We consider the I-bit quantization case. Following a param¬ 
eter setting similar to that shown in Figure 3, Figure 4 depicts 
the average BER versus the step sizes under 1-bit quantization 
with SNR=5dB. The performance of the 1-bit DQ-optimal 
detector is irrelevant to the value of step sizes. The reason is 
that the integrals in lines 3^ in Algorithm 1 [or, specifically, 
(10) and (11)] are taken w.r.t. either an interval (—00,0] or 
[0, -foo) and only the sign of the output is of interest. However, 
this property is not true for the other detectors. The perfor¬ 
mance of the PDQ-optimal detector and the linear detector 
are affected by the value of A although the former appears to 
be more sensitive. For example, when the step size is at the 
optimal value of 2.06, the PDQ-optimal detector achieves its 
best performance, which coincides with our inference that the 
performance of the PDQ-optimal detector is between that of 
the DQ-optimal detector and the linear detector. Recall that 
we choose the step size A = 0.5 in Figure 3 and the BER 
for the PDQ-optimal detector is 0.280. Thus, the PDQ-optimal 
detector does not work effectively in Eigure 3. 

Eor the multi-bit uniform quantization, the optimum step 
sizes that minimize the corresponding BER for a QPSK input 


signal are compared to the input bit resolutions in Table 
III. The corresponding step size values of the PDQ-optimal 
detector and the linear detectors are remarkably close for all 
bits because both detectors treat the quantization process as 
additive quantization error. This observation suggests that a 
prior mismatch may not cause a significant difference in terms 
of an optimized step size. 

Recall that the element of a ADC input yi is the sum over 
many independent terms and thus can be approximated as a 
Gaussian distributed variable with variance 1 Therefore, 
we normalize the optimal step sizes according to 


A 


norm 


a/2A 


(28) 


and average them over different SNRs. In this case, the step 
size is multiplied by \/2 because the signal power is -\/2 times 
the power of the real or imaginary part. The corresponding 
normalization results are shown in Table IV. The optimal 
step sizes that try to minimize the quantization distortion 
for a Gaussian input signal [36] are listed in Table IV for 
comparison. The value of Anorm is very close to those in 
[36] for all input bit rates k varying from 2 to 7. In fact, 
through extensive simulations under different antenna ratios, 
input constellations, and performance metrics (MSE or BER), 
we find that this property is general. Consequently, we can 
safely use the values in [36] as a consistent rule to assist the 
step size design. The rule of thumb does not apply to the 
1-bit quantization case or the DQ-optimal detector. However, 
for these special cases, we can quickly obtain the optimal step 
sizes based on the analytical expressions. 
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Fig. 4. BER versus step size under 1-bit quantization with A = 4, SNR = 
5 dB and a QPSK input. 


TABLE III 

Optimal step sizes for DQ-optimal/PDQ-optimal/linear 
DETECTORS (A = 4). 


TABLE IV 

Comparison the optimal step sizes for different purposes. 


K. 

^norm 

^norm 

^opt. 

(bits) 

(PDQ-Opt.) 

(Linear) 

from [36] 

2 

1.0826 

0.9619 

1.0080 

3 

0.6014 

0.5836 

0.5895 

4 

0.3390 

0.3345 

0.3360 

5 

0.1941 

0.1936 

0.1883 

6 

0.1042 

0.1037 

0.1041 

7 

0.0568 

0.0568 

0.0569 


transmitted inputs.^ From the results in Figures 5(a) and 5(b), 
we find that in a typical massive MIMO system (e.g., A = 4), 
the MSEs of the PDQ-optimal detector are not sensitive to 
the value of as long as its value is small enough, e.g., 
(Tq < 1. However, for the case when the number of antennas 
are compatible with that of the users (e.g., A = 1), we should 
be more careful in selecting the value of tr^. Comparing 
the triangle markers that correspond to the lowest MSE and 


the 


V 

markers determined by cr^ = 


find that 


approximating by the latter rule is generally good enough. 
Although not precise, this rule of thumb seems effective. 


SNR 

(dB) 

1 bit 

2 bit 

3 bit 

-5 

— /4.247/5.150 

1.990/1.461/1.438 

1.147/0.849/0.845 

-2.5 

— 73.245/3.595 

1.611/1.217/1.173 

0.932/0.697/0.689 

0 

^ 72.636/2.638 

1.338/1.058/0.992 

0.781/0.596/0.583 

2.5 

— 72.271/1.925 

1.132/0.954/0.874 

0.674/0.531/0.514 

5 

^ 72.057/1.519 

0.970/0.886/0.801 

0.594/0.489/0.475 

7.5 

— 71.934/1.277 

0.835/0.844/0.759 

0.531/0.464/0.448 

10 

— 71.865/1.131 

0.717/0.818/0.735 

0.474/0.449/0.432 

12.5 

— 71.825/1.043 

0.609/0.804/0.721 

0.419/0.440/0.423 

15 

— 71.803/0.990 

0.511/0.795/0.713 

0.366/0.436/0.419 

17.5 

— 71.790/0.959 

0.423/0.791/0.708 

0.315/0.433/0.416 

20 

— 71.783/0.942 

0.347/0.788/0.706 

0.267/0.431/0.415 

.SNR 


4 bit 

5 bit 

6 bit 

(dB) 

-5 

0.651/0.484/0.483 

0.365/0.272/0.271 

0.202/0.150/0.150 

-2.5 

0.530/0.396/0.394 

0.298/0.222/0.221 

0.165/0.123/0.122 

0 

0.447/0.337/0.333 

0.251/0.188/0.187 

0.145/0.104/0.103 

2.5 

0.389/0.298/0.295 

0.220/0.167/0.165 

0.123/0.092/0.092 

5 

0.349/0.274/0.272 

0.199/0.153/0.152 

0.111/0.085/0.084 

7.5 

0.318/0.260/0.256 

0.184/0.145/0.145 

0.104/0.080/0.079 

10 

0.292/0.251/0.247 

0.171/0.145/0.145 

0.098/0.077/0.077 

12.5 

0.267/0.246/0.245 

0.160/0.145/0.145 

0.092/0.076/0.075 

15 

0.241/0.245/0.240 

0.149/0.145/0.145 

0.087/0.075/0.075 

17.5 

0.216/0.245/0.239 

0.137/0.145/0.145 

0.082/0.074/0.074 

20 

0.190/0.245/0.238 

0.125/0.145/0.145 

0.077/0.074/0.074 


B. Regularization Factor 

In the PQN model, the nonlinear quantization process is 
treated as additive quantization error with variance cr^ in (14) 
to simplify computation. So far, we have used the variance of 
a uniform distribution on a quantization interval y] to 

model cr^, i.e., = ^. We now examine this approximation. 

Eigure 5 shows the MSEs of the PDQ-optimal detector with 
varying numbers of quantization bit depth k for Gaussian 


C. Mixed Architecture 

We have discussed the optimal parameter settings for the 
quantization step size and the regularization factor. Using these 
optimal parameters, we now investigate the performance of the 
mixed-ADC architecture. Our discussions focus on the follow¬ 
ing question: how close to the best performance can a MIMO 
detector operate without considering the exact nonlinear effect 
of the quantizers? In the following subsections, we refer to 
a system where most of the antennas adopt 1-bit resolution 
ADCs while the rest have full precision ADCs as the mixed 
1-bit architecture. We also have the mixed 2-bit architecture 
and others. 

1} QPSK input: Eigure 6 illustrates the BER versus the 
SNR under different mixed architectures for QPSK inputs. 
Clearly, the mixed architecture helps improve the performance. 
Eor example, for the mixed 1-bit architecture, merely installing 
5% high-resolution ADCs (i.e., ^ = 95%) eliminates the 
error floor caused by the distortion of the mismatched measure 
of the PDQ-optimal detector as well as the 1-bit quantization. 
If we further increase the fraction of the full-precision ADCs 
to 10% (i.e.,^ = 90%), the mixed PDQ-optimal detector 
can achieve similar performance as the pure 1-bit DQ-optimal 
detector. Also, in the mixed 2-bit architecture, an 80% 2-bit 
quantization mixed PDQ-optimal detector achieves a perfor¬ 
mance similar to the pure 2-bit DQ-optimal detector. Recall 
that the DQ-optimal detector is considered to achieve the 
best estimate in terms of minimizing the MSE. We conclude 
that the mixed architecture can help maintain the promised 
performance while significantly reducing the computational 
complexity by ignoring the exact but complex quantization 
process. In addition, comparing Eigures 6(a) and 6(b), we 

^In this case, the PDQ-optimal detector is exactly the linear detector because 
both the postulated and the actual inputs are Gaussian. 
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(a) SNR=0dB 



(b) SNR=20dB 

Fig. 5. MSE of PDQ-optimal detector versus under different quantization 
levels with a) SNR=0dB and b) SNR=20dB. The triangle markers correspond 

n A 2 

to the lowest MSE while the cross marks correspond to 


notice that the mixed architecture is more advantageous to the 
PDQ-optimal detector than the DQ-optimal detector, which 
will also be validated by Figure 8. 

2 ) Gaussian input: In such an input type, the PDQ-optimal 
detector is exactly the linear detector because both the postu¬ 
lated and the actual inputs are Gaussian. We start by providing 
insights into the effect of quantization based on the non-mixed 
architecture. Figure 7 illustrates the MSEs of the detectors ver¬ 
sus the antenna configuration ratios X = M/N for SNR=0 dB 
and SNR=20dB. As shown in Figure 7(a), when SNR=0dB, 
the performance degradation compared to the unquantized case 
caused by 1-bit quantization is approximately 3dB, indicating 
that ignoring the exact quantization effect is feasible in the 
low SNR regime. In the high SNR regime, however, a 1-bit 
resolution generally incurs significant performance losses as 
shown in Figure 7(b). In a typical massive MIMO system (e.g., 
A = 16), the PDQ-optimal detector generally causes 1-bit loss 
compared with the DQ-optimal detector. For example, the 2- 



(a) PDQ-optimal detector 



SNR (dB) 

(b) DQ-optimal detector 

Fig. 6. BER versus SNR under a mixed-ADC architecture for different 
detectors with QPSK inputs. 


bit PDQ-optimal detector has the same performance as the 1- 
bit DQ-optimal detector. Alternatively, the loss caused by the 
simplification of the quantization process can be compensated 
by doubling the number of receiver antennas. Figures 7(a) and 
7(b) shows that the MSEs of the detectors generally improve 
by 3-6 dB for each 1-bit rate increase. The higher the SNR is, 
the closer the MSE improvement is to 6 dB. 

In contrast to Eigure 7, which focuses on the non-mixed 
architecture, Eigure 8 illustrates the results under the mixed 
architecture. It can be seen that the mixed architecture can help 
narrow the performance gap resulting from the PQN model. 
Eor example, in a massive MIMO system with A > 10, the 
pure 1-bit PDQ-optimal detector incurs approximately 6dB 
loss compared with the pure 1-bit DQ-optimal detector. Their 
corresponding gaps are respectively reduced to 3 dB and 1 dB 
when 5% and 20% full precision ADCs are installed. 




























II 
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A 


(a) SNR=OdB 



Fig. 7. MSE for a Gaussian input signal versus MIMO configuration A = 
N/K with a) SNR^OdB and b) SNR=20dB. 


VI. Conclusion 

Using a unified framework, we specified three kinds of 
detectors for a mixed-ADC massive MIMO receiver by pos¬ 
tulating mismatched measures in the Bayes detector. The 
asymptotic performances of these detectors were analyzed 
by the SE equations and their accuracy were validated by 
Monte Carlo simulations. The SE equations can be quickly 
and efficiently evaluated; thus, they are useful in system design 
optimization. We provided useful observations to aid design 
optimization. In particular, the results showed that we can 
reduce the computational burden by treating the complex 
nonlinear quantization process as a PQN model. Ignoring such 
a complex quantization process may cause 1-bit performance 
degradation on the high-SNR regime but does not have a 
significant effect in the low-SNR regime. Moreover, the mixed 
receiver architecture can help narrow the performance gap 
resulting from the PQN model. 


1 2 4 8 16 32 64 128 256 

A 


Fig. 8. MSE for a Gaussian input signal versus MIMO configuration A = 
N/K m a mixed architecture with SNR=20dB. 


Appendix A 

Proof of Proposition 1 

In this appendix, we derive the SE equations of Algorithm 
1 in the large-system limit (i.e., N,K ^ oo) by following 
[25]. Eor conceptual clarity and ease of explanation, we restrict 
the derivation to a corresponding real-valued system although 
Proposition 1 is for the complex-valued case. In addition, for 
brevity, we omit all the iteration index t. 

We recall from lines 7-8 of Algorithm 1 that the conditional 
mean and variance of xj denoted by Xj and , respectively, 
are taken w.r.t. the marginal posterior ^{xj\sj,Vsj) given in 
(12), which is determined by (s*, n* ). Therefore, to obtain the 
SE equations, we have to determine the asymptotic behavior 
of Sj and Vg^. 

We begin with the analysis of Vg^. Let be the f-th row 
of h and Wi = h.fx. We notice that given a fixed hf, 

E{hfx} = ^ hijXj = m, (A.la) 

j 

Var{hfx} = ^ = v^., (A.lb) 

3 

where the expectation and variance are taken w.r.t. 
^{xj\sj,Vs .). According to the CLT in the large system limit, 
Wi can be regarded as a Gaussian random variable with mean 
Wi and variance va,.. Let 

= / dwPout{ri\w)e , (A.2) 

and we use S'(r|rt;) and 5"(r|?u) to denote the first-order 
and second-order derivatives of S(r|ty) with respect to w, 
respectively. Then, Vg^ in line 5 of Algorithm 1 is expressed 
as 

— = ^ Oy - ^ Pij (A.3) 

^Sj 

J 1 r 
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where 


'vin\wi) 


CXij — 


h- and /?.. - 


Letting 


Here and hereafter, we denote ^{x) = ^{x\sj,Vs-) for no¬ 
tation simplicity. We find that the covariance matrix becomes 
(A.4) asymptotically independent of the index i. Altogether, these 
provide the bivariate Gaussian distribution; 


A = E “u 

i 

and 

Pj ~ E 

i 

(A.5) 

Pw,w{w,W 

yields 

1 

= A,- 

-p,. 

(A.6) 

Now, we a 
P(r„,-u;,|H). 

Vs- 

Then, we explore the asymptotic (or the large system) behavior 

P(r„,i<),|H). 

P{ri^,Wi) as*^ 

of Aj and Bj. 

To this end, we find that when H is fixed. 


P{r^,w,) = j 

Aj E^r] 

i 

[_ 



(A.7a) 

=p{^kA j 

Bj -> E^rj 

i 

^ R. . 

H) 

— EBk, 

K 

(A.7b) 



- W,Vr. - — 


(A.IO) 


(Wi,Wi) 


where —> indicates the convergence to the asymptotic limit, 
and we define 


Ak — h|, 




= E4eKIh} 

3 

J x'^Pin{x)dx = Vx, 


= E4e{4ih} 


H 


J = Vx, 

EjwiWilH} = ^ hfjElxjXjl H} 


3 

E<! cc 


';,^(a;)da 


X \ > = Vxx- 


= P{'r^\,)M{w^\0,Vx) 

X f dwiAf 


(A. 8a) 
(A.8b) 


^XX -V '^XX 

- W^,Vx - 

Vi> Vi- 


P{xK\iW{wi\0,Vx)d>i n 


Vx 


dyN'iy\w^,al) 

(AM) 


Here, r„ represents the quantized signals that belong to 
discrete set TZ^. To compute the expectations in (A.8), we 
need the joint distribution P(rK,r()i|H) because and f3ij 
depend on two correlated variables and Wi. 

Our strategy to obtain P(rK,r()i|H) is via the marginal of 
P(r„,u)i, Wi|H) = P(r„,u)i|wi,H)P(wi,u)i). Therefore, we 
calculate the joint distribution P{wi,Wi) first. Both Wi and w* 
are sums over many independent terms. Therefore, according 
to the CLT, they are Gaussian random variables. Their means 
are zero because {hij} has zero mean. The entries of the 
covariance matrix between Wi and Wj is 


H 


where is the vector containing the elements of exclud¬ 
ing Vi, and we define 


’d’i{r^\x) = $ 


_$ 


^low _ 2- 


Vx - if^ / \<Xn+Vx- 

(A.12) 

Using the joint distribution (A. 11), we can compute the 
asymptotic behavior of (A.8) as 


Ak= E Er„{a?,|H} 

ien„ 

= E 




d'U)jdrKP(r«,'U)j) ( 


= E E I dwiAf{m\0,Vx) 




X ^ n 


(A.9a) 




li{ri\wi) 

Ak E [ ^wAf{w\d,Vx)di(r''^'‘'‘ 




w 


and, similarly, 

Bk-> A„ ^ f dwJ\f{w\0,Vx)'i>(r 


(A.9b) 


(A.9c) 


rGTZri 


!'(r|u)) 
i!(r|ru) 
(A.13) 

•"(r|u;) 


l{r\w) 

(A. 14) 


^In the derivation, we use the fact that the product of two Gaussians 
provides another Gaussian, i.e., M{x\a, A)M{x\b, B) = ZAf{x\c,C), 
where c = C{A-^a C = (A"! + and Z = 

1 


^27r(A+S) 


, (a —6) 

i “ 2(A+B) 
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Note that and are asymptotically independent on the 
indexes i,j in the large system limit. We also drop the index 
i of 'i>i{ri\x) defined in (A. 12). Moreover, we notice that 
defined in (A.2) become independent of the index 
i, and read as 

S(r|w) = J dwPout{r\w)e , (A.15) 

where w and Vw are the asymptotic limits of Wi and v^i, 
respectively, and Vu, can be expressed as 


Substituting (A.21) into (A.9) and (A. 16), the parameters 
Vx, Cx, Vx, and Vxx can be obtained, which are 



x^3^{x\x, z)dx 
x^^{x\x, z)dx 
f x3^{x\x,z)dx 



(A.22a) 

(A.22b) 

(A.22c) 

(A.22d) 


'^ijVxj 


H 


= e|E^? 


— • 


(A.16) 


Together with these results, (A. 13) and (A. 14) can be further 
simplified as 


A 


K 


B 


K 



Thus, we obtain the asymptotic behavior of 1/usj in (A.6) as 
— ^A-B = E, (A.18) 

Vs - 

Sj 

where A = ^ ~ Sk with A„ and B^ given in 

(A. 17). Note that l/vg- becomes asymptotically independent 
of the index j. 

Next, we consider the asymptotic behavior of sj given 
in line 6 of Algorithm 1. Following the similar argument 
presented in this paper, we can prove that Sj conditioned on 
Xj is asymptotic Gaussian. Again, we drop the index j in the 
large system limit, and we show that the asymptotic mean and 
variance of s is ^ ■ x and ^ with D = D^, where 


Dk — A„ E! 

reKs 



^{r\ydviu) ’ 


(A.19) 


and 'I''(r|t6) represent the derivatives of T* with respect to u. 
Therefore we denote 


s = 



(A.20) 


where z ~ A/'(0,1) and x ~ P\n{x). Using (A.18) and (A.20), 
the marginal posterior (12) can be rewritten as 


Pin (x) exp — 


2 — 
^ IT 


[x\x,z) = 


f Pin (x) exp -- 


D ■ v/Az: 


(A.21) 


2 — 
^ rp 


da: 


Finally, we conclude that the asymptotic behavior of Sj 
in Algorithm 1 can be characterized by (A.20), where the 
parameters A,D,E are determined by (A.17) and (A.19) 
in conjunction with the parameters Vx,Cx,Vx,Vxx given in 
(A.22). For the complex-valued case, the signal power of 
the real or imaginary part is 1/3/2. Therefore, the equivalent 
quantization interval should multiply a factor of 3/2 as in (21). 
Besides, the relative scalar multiplication should be modified 
to complex multiplication as in (20) and (22). 
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